AMENDMENT TO THE CLAIMS 



1 . (Currently amended) A method of adapting an n-gram language model for a new 
domain, the method comprising: 

receiving background data indicative of general text phrases not directed to the 
new domain; 

receiving a set of semantic entities used in the new domain and organized in 
classes; 

generating background n-gram class count data based on the background data and 

the semantic entities and classes thereof; 
receiving adaptation data indicative of text phrases used in the new domain; 
generating adaptation n-gram class count data based on the adaptation data and 

the semantic entities and classes thereof; 
training a language model based on the background n-gram class count dat a and 

the adaptation n-gram class count data ; and 
embodying the language model in tangible form. 

2. (Cancelled) 

3. (Currently amended) The method of claim 2- 1 and further comprising: 

generating background n-gram word data based on the background n-gram class 
count data and the semantic entities and classes thereof; 

generating adaptation n-gram word data based on the adaptation n-gram class 
count data and the semantic entities and classes thereof; and 

wherein training the language model based on the background n-gram class count 
data and the adaptation n-gram class count data comprises using 
background n-gram word data and adaptation n-gram word data. 

4. (Original) The method of claim 3 wherein generating background n-gram word data 
comprises generating background n-gram word data for multi-word semantic entities with 
each data entry comprising a selected number of words. 



5. (Original) The method of claim 4 wherein generating adaptation n-gram word data 
comprises generating adaptation n-gram word data for multi-word semantic entities with 
each data entry comprising a selected number of words. 

6. (Original) The method of claim 4 wherein generating background n-gram class count 
data based on the background data and the semantic entities and classes thereof 
comprises tagging word level background data based on the semantic entities and classes 
thereof. 

7. (Original) The method of claim 5 wherein generating adaptation n-gram class count 
data based on the adaptation data and the semantic entities and classes thereof comprises 
tagging word level adaptation data based on the semantic entities and classes thereof. 

8. (Original) The method of claim 6 wherein generating background n-gram class count 
data based on the background data and the semantic entities and classes thereof 
comprises counting unique class level n-grams of the tagged background data. 

9. (Original) The method of claim 7 wherein generating adaptation n-gram class count 
data based on the adaptation data and the semantic entities and classes thereof comprises 
counting unique class level n-grams of the tagged adaptation data. 

10. (Original) The method of claim 8 wherein generating background n-gram class count 
data based on the background data and the semantic entities and classes thereof 
comprises discarding some class n-grams from the tagged background data. 

11. (Original) The method of claim 9 wherein generating adaptation n-gram class count 
data based on the adaptation data and the semantic entities and class thereof comprises 
discarding some class n-grams from the tagged adaptation data. 



12. (Currently amended) A computer-readable storage medium having computer- 
executable instructions for performing steps to generate a language model, the steps 
comprising: 

receiving a set of semantic entities used in a selected domain and organized in 
classes; 

receiving background n-grams class count data correlated to classes of the set of 
semantic entities and based on background data indicative of general text; 

receiving adaptation n-gram class count data correlated to classes of the set of 
semantic entities and based on adaptation data indicative of a selected 
domain to be modeled; and 

training a language model based on the background n-gram class count data, the 
adaptation n-gram class count data and the set of semantic entities ^; and 

wherein training the language model comprises computing background word 

count data based on the background n-gram class count data and the set of 
semantic entities, computing adaptation word count data based on the 
adaptation n-gram class count data and the set of semantic entities, and 
smoothing the n-gram relative frequencies. 

13-15. (Cancelled) 

16. (Currently amended) The computer-readable storage medium of claim 45- 12 wherein 
smoothing comprises using a deleted-interpolation algorithm. 

17. (New) The method of claim 3 and further comprising: 

wherein generating background n-gram word data comprises generating 

background n-gram word data for multi-word semantic entities with each 

data entry comprising a selected number of words; 
wherein generating adaptation n-gram word data comprises generating adaptation 

n-gram word data for multi-word semantic entities with each data entry 

comprising a selected number of words; 



wherein generating background n-gram class count data based on the background 
data and the semantic entities and classes thereof comprises tagging word 
level background data based on the semantic entities and classes thereof; 
and 

wherein generating adaptation n»gram class count data based on the adaptation 
data and the semantic entities and classes thereof comprises tagging word 
level adaptation data based on the semantic entities and classes thereof. 



